NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Constrained Reinforcement Learning for Fair and Environmentally Efficient Traffic Signal Controllers

https://doi.org/10.1145/3676169

Haydari, Ammar; Aggarwal, Vaneet; Zhang, Michael; Chuah, Chen-Nee (March 2025, ACM Journal on Autonomous Transportation Systems)

Traffic signal controller (TSC) has a crucial role in managing traffic flow in urban areas. Recently, reinforcement learning (RL) models have received a great attention for TSC with promising results. However, these RL-TSC models still need to be improved for real-world deployment due to limited exploration of different performance metrics such as fair traffic scheduling or air quality impact. In this work, we introduce a constrained multi-objective RL model that minimizes multiple constrained objectives while achieving a higher expected reward. Furthermore, our proposed RL strategy integrates the peak and average constraint models to the RL problem formulation with maximum entropy off-policy models. We applied this strategy to a single TSC and a network of TSCs. As part of this constrained RL-TSC formulation, we discuss fairness and air quality parameters as constraints for the closed-loop control system optimization model at TSCs calledFAirLight. Our experimental analysis shows that the proposedFAirLightachieves a good traffic flow performance in terms of average waiting time while being fair and environmentally friendly. Our method outperforms the baseline models and allows a more comprehensive view of RL-TSC regarding its applicability to the real world.
more » « less
Full Text Available
Fast Estimation of Globally Optimal Independent Contact Regions for Robust Grasping and Manipulation

https://doi.org/10.1109/Humanoids65713.2025.11203052

King, Jonathan P; Ahluwalia, Harnoor; Zhang, Michael; Pollard, Nancy S (September 2025, IEEE)

Full Text Available
Simple linear attention language models balance the recall-throughput tradeoff

Arora, Simran; Eyuboglu, Sabri; Zhang, Michael; Timalsina, Aman; Alberti, Silas; Zou, James; Rudra, Atri; Re, Christopher (July 2024, Proceedings of the 41st International Conference on Machine Learning)

Full Text Available
Deep Learning and Symbolic Regression for Discovering Parametric Equations

https://doi.org/10.1109/TNNLS.2023.3297978

Zhang, Michael; Kim, Samuel; Lu, Peter Y.; Soljačić, Marin (January 2024, IEEE Transactions on Neural Networks and Learning Systems)

Full Text Available
Propagating Knowledge Updates to LMs Through Distillation

Padmanabhan, Shankar; Onoe, Yasumasa; Zhang, Michael JQ; Durrett, Greg; Choi, Eunsol (December 2023, Advances in neural information processing systems)

Modern language models have the capacity to store and use immense amounts of knowledge about real-world entities, but it remains unclear how to update such knowledge stored in model parameters. While prior methods for updating knowledge in LMs successfully inject atomic facts, updated LMs fail to make inferences based on injected facts. In this work, we demonstrate that a context distillation-based approach can both impart knowledge about entities and propagate that knowledge to enable broader inferences. Our approach consists of two stages: transfer set generation and distillation on the transfer set. We first generate a transfer set by prompting a language model to generate continuations from the entity definition. Then, we update the model parameters so that the distribution of the LM (the student) matches the distribution of the LM conditioned on the definition (the teacher) on the transfer set. Our experiments demonstrate that this approach is more effective at propagating knowledge updates than fine-tuning and other gradient-based knowledge-editing methods. Moreover, it does not compromise performance in other contexts, even when injecting the definitions of up to 150 entities at once.
more » « less
Full Text Available
Simple Hardware-Efficient Long Convolutions for Sequence Modeling

Fu, Daniel Y.; Epstein, Elliot L.; Nguyen, Eric; Thomas, Armin W.; Zhang, Michael; Dao, Tri; Rudra, Atri; Ré, Christopher (July 2023, Proceedings of the 40th International Conference on Machine Learning (ICML))

Full Text Available
Can LMs Learn New Entities from Descriptions? Challenges in Propagating Injected Knowledge

Onoe, Yasumasa; Zhang, Michael J.Q.; Padmanabhan, Shankar; Durrett, Greg; Choi, Eunsol (January 2023, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Full Text Available
Differentially Private Map Matching for Mobility Trajectories

https://doi.org/10.1145/3564625.3567974

Haydari, Ammar; Chuah, Chen-Nee; Zhang, Michael; Macfarlane, Jane; Peisert, Sean (December 2022, ACM)
The Hazy and Metal-rich Atmosphere of GJ 1214 b Constrained by Near- and Mid-infrared Transmission Spectroscopy

https://doi.org/10.3847/1538-4357/acd16f

Gao, Peter; Piette, Anjali A.; Steinrueck, Maria E.; Nixon, Matthew C.; Zhang, Michael; Kempton, Eliza M.-R.; Bean, Jacob L.; Rauscher, Emily; Parmentier, Vivien; Batalha, Natasha E.; et al (July 2023, The Astrophysical Journal)

Abstract The near-infrared transmission spectrum of the warm sub-Neptune exoplanet GJ 1214 b has been observed to be flat and featureless, implying a high metallicity atmosphere with abundant aerosols. Recent JWST MIRI Low Resolution Spectrometer observations of a phase curve of GJ 1214 b showed that its transmission spectrum is flat out into the mid-infrared. In this paper, we use the combined near- and mid-infrared transmission spectrum of GJ 1214 b to constrain its atmospheric composition and aerosol properties. We generate a grid of photochemical haze models using an aerosol microphysics code for a number of background atmospheres spanning metallicities from 100 to 1000× solar, as well as a steam atmosphere scenario. The flatness of the combined data set largely rules out atmospheric metallicities ≤300× solar due to their large corresponding molecular feature amplitudes, preferring values ≥1000× solar and column haze production rates ≥10 −10 g cm −2 s −1 . The steam atmosphere scenario with similarly high haze production rates also exhibits sufficiently small molecular features to be consistent with the transmission spectrum. These compositions imply that atmospheric mean molecular weights ≥15 g mol −1 are needed to fit the data. Our results suggest that haze production is highly efficient on GJ 1214 b and could involve non-hydrocarbon, non-nitrogen haze precursors. Further characterization of GJ 1214 b’s atmosphere would likely require multiple transits and eclipses using JWST across the near- and mid-infrared, potentially complemented by ground-based high-resolution transmission spectroscopy.
more » « less
Full Text Available
Impact of Deep RL-based Traffic Signal Control on Air Quality

https://doi.org/10.1109/VTC2021-Spring51267.2021.9448639

Haydari, Ammar; Zhang, Michael; Chuah, Chen-Nee; Ghosal, Dipak (April 2021, 2021 IEEE 93rd Vehicular Technology Conference (VTC2021-Spring))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records